Some Observations on the Use of the Extended Kalman Filter as a Recurrent Network Learning Algorithm
نویسنده
چکیده
The extended Kalman lter (EKF) can be used as an on-line algorithm to determine the weights in a recurrent network given target outputs as it runs. This involves forming an augmented network state vector consisting of all unit activities and weights. This report notes some relationships between the EKF as applied to recurrent net learning and some simpler techniques that are more widely used. In particular, it is shown that making certain simpliications to the EKF gives rise to an algorithm essentially identical to the real-time recurrent learning (RTRL) algorithm. That is, the resulting algorithm both maintains the RTRL data structure and prescribes identical weight changes. In addition, because the EKF also involves adjusting unit activity in the network, it provides a principled generalization of the useful \teacher forcing" technique. Very preliminary experiments on simple nite-state Boolean tasks indicate that the EKF works well for these, generally giving substantial speed-up in number of time steps required for convergence to a solution when compared with the behavior of simpler on-line gradient algorithms like RTRL or truncated backpropagation through time (BPTT). The computational requirements of the EKF for this application are also compared with those of RTRL and BPTT. For a network having n units its storage requirements scale as O(n 4), which exceeds the O(n 3) storage required by RTRL. Interestingly, however, the computation can be organized so that the number of arithmetic operations per time step scales as O(n 4), which is no worse (except by a constant factor) than RTRL. Finally, some speculation is ooered that the EKF and related algorithms may help provide a basis for both gaining a deeper understanding of existing recurrent network learning techniques and creating more computationally attractive algorithms that nevertheless retain the advantages of the EKF, most notably its incrementality and superior convergence speed.
منابع مشابه
Sensorless Speed Control of Double Star Induction Machine With Five Level DTC Exploiting Neural Network and Extended Kalman Filter
This article presents a sensorless five level DTC control based on neural networks using Extended Kalman Filter (EKF) applied to Double Star Induction Machine (DSIM). The application of the DTC control brings a very interesting solution to the problems of robustness and dynamics. However, this control has some drawbacks such as the uncontrolled of the switching frequency and the strong ripple t...
متن کاملTraining Recurrent Networks Using the Extended Kalman Filter
The extended Kalman lter (EKF) can be used as an on-line algorithm to determine the weights in a recurrent network given target outputs as it runs. This paper notes some relationships between the EKF as applied to recurrent net learning and some simpler techniques that are more widely used. In particular, making certain simpliications to the EKF gives rise to an algorithm essentially identical ...
متن کاملTuning of Extended Kalman Filter using Self-adaptive Differential Evolution Algorithm for Sensorless Permanent Magnet Synchronous Motor Drive
In this paper, a novel method based on a combination of Extended Kalman Filter (EKF) with Self-adaptive Differential Evolution (SaDE) algorithm to estimate rotor position, speed and machine states for a Permanent Magnet Synchronous Motor (PMSM) is proposed. In the proposed method, as a first step SaDE algorithm is used to tune the noise covariance matrices of state noise and measurement noise i...
متن کاملDesign of Instrumentation Sensor Networks for Non-Linear Dynamic Processes Using Extended Kalman Filter
This paper presents a methodology for design of instrumentation sensor networks in non-linear chemical plants. The method utilizes a robust extended Kalman filter approach to provide an efficient dynamic data reconciliation. A weighted objective function has been introduced to enable the designer to incorporate each individual process variable with its own operational importance. To enhance...
متن کاملRotated Unscented Kalman Filter for Two State Nonlinear Systems
In the several past years, Extended Kalman Filter (EKF) and Unscented Kalman Filter (UKF) havebecame basic algorithm for state-variables and parameters estimation of discrete nonlinear systems.The UKF has consistently outperformed for estimation. Sometimes least estimation error doesn't yieldwith UKF for the most nonlinear systems. In this paper, we use a new approach for a two variablestate no...
متن کاملNonlinear system identification with recurrent neural networks and dead-zone Kalman filter algorithm
Compared to normal learning algorithms, for example backpropagation, Kalman filter-based algorithm has some better properties, such as faster convergence, although this algorithm is more complex and sensitive to the nature of noises. In this paper, extended Kalman filter is applied to train state-space recurrent neural networks for nonlinear system identification. In order to improve robustness...
متن کامل